Kernelized Rényi distance for speaker recognition
نویسندگان
چکیده
Speaker recognition systems classify a test signal as a speaker or an imposter by evaluating a matching score between input and reference signals. We propose a new information theoretic approach for computation of the matching score using the Rényi entropy. The proposed entropic distance, the Kernelized Rényi distance (KRD), is formulated in a non-parametric way and the resulting measure is efficiently evaluated in a parallelized fashion on a graphical processor. The distance is then adapted as a scoring function and its performance compared with other popular scoring approaches in a speaker identification and speaker verification framework.
منابع مشابه
Kernelized Rényi distance for subset selection and similarity scoring
Rényi entropy refers to a generalized class of entropies that have been used in several applications. In this work, we derive a non-parametric distance between distributions based on the quadratic Rényi entropy. The distributions are estimated via Parzen density estimates. The quadratic complexity of the distance evaluation is mitigated with GPUbased parallelization. This results in an efficien...
متن کاملEfficient subset selection via the kernelized Rényi distance
With improved sensors, the amount of data available in many vision problems has increased dramatically and allows the use of sophisticated learning algorithms to perform inference on the data. However, since these algorithms scale with data size, pruning the data is sometimes necessary. The pruning procedure must be statistically valid and a representative subset of the data must be selected wi...
متن کاملTitle of dissertation : SCALABLE LEARNING FOR GEOSTATISTICS AND SPEAKER RECOGNITION Balaji Vasan Srinivasan Doctor of Philosophy , 2011
Title of dissertation: SCALABLE LEARNING FOR GEOSTATISTICS AND SPEAKER RECOGNITION Balaji Vasan Srinivasan Doctor of Philosophy, 2011 Thesis directed by: Professor Ramani Duraiswami Department of Computer Science With improved data acquisition methods, the amount of data that is being collected has increased several fold. One of the objectives in data collection is to learn useful underlying pa...
متن کاملScalable learning for geostatistics and speaker recognition
With improved data acquisition methods, the amount of data that is being collected has increased several fold. One of the objectives in data collection is to learn useful underlying patterns. In order to work with data at this scale, the methods not only need to be effective with the underlying data, but also have to be scalable to handle larger data collections. My research focused on developi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010